Skip to main content

Reply to this post | Go Back
View Post [edit]

Poster: aibek Date: Dec 28, 2012 8:27am
Forum: texts Subject: Try pdfcrop before printing PDF files

Try Heiko Oberdiek’s ‘pdfcrop’ before printing the Internet Archive PDF files. It does a wonderful job of cutting out excessive white margins (automatically). This makes the fonts larger, and the pages much more attractive.

https://code.google.com/p/pdfcrop2/
http://code.google.com/p/pdfcrop2/source/browse/trunk/pdfcrop.pl

The software is included in the Debian and Ubuntu ‘texlive-extra-utils’ package.

You can also try another ‘pdfcrop’, and ‘pdfnup’. (See the links.)
http://pdfcrop.sourceforge.net/
http://vince-debian.blogspot.com/2008/09/pdfcrop-vs-pdfnup-trim.html

---
Update [2012-12-28]: Both the above pdfcrops call ghostscript to calculate a tight BoundingBox for every page. The first one then works using pdflatex, the second using pdfedit.


This post was modified by aibek on 2012-12-28 16:27:03

Reply to this post
Reply [edit]

Poster: ecossais Date: Apr 27, 2012 3:47am
Forum: texts Subject: Re: Try pdfcrop before printing PDF files

Thanks very much, I must have a look at those.

My personal favourite is 'briss'. An extremely simple program that allows one to graphically, manually chop off margins. I use it to crop PDFs before putting them on an ebook reader :) It works on Windows and Linux. I think there are screenshots

http://sourceforge.net/projects/briss/

Reply to this post
Reply [edit]

Poster: aibek Date: Dec 15, 2012 11:32pm
Forum: texts Subject: Also check unpaper

http://unpaper.berlios.de/

unpaper is a post-processing tool for scanned sheets of paper, especially for book pages that have been scanned from previously created photocopies. The main purpose is to make scanned book pages better readable on screen after conversion to PDF. Additionally, unpaper might be useful to enhance the quality of scanned pages before performing optical character recognition (OCR). unpaper tries to clean scanned images by removing dark edges that appeared through scanning or copying on areas outside the actual page content (e.g. dark areas between the left-hand-side and the right-hand-side of a double-sided book-page scan). The program also tries to detect disaligned centering and rotation of pages and will automatically straighten each page by rotating it to the correct angle. … It is always a good idea to manually control the results of unpaper and adjust the parameter settings according to the requirements of the input. Each processing step can also be disabled individually for each sheet. Input and output files can be in either .pbm , .pgm or .ppm format … Conversion to PDF can e.g. be achieved with the Linux tools pgm2tiff, tiffcp and tiff2pdf.

Reply to this post
Reply [edit]

Poster: aibek Date: Dec 28, 2012 8:33am
Forum: texts Subject: Re: Try pdfcrop before printing PDF files

Thanks for your additions, martyveldman and ecossais.

There are many scanned books with pages having text to the left and right alternatively. Here pdfcrop is very useful as it estimates the minimum area for each page separately, and yet outputs the whole thing in one pdf file.

There is this though: pdfcrop 1.20 apparently takes only standard paper sizes. In the Internet Archive books, I have to first change the paper size to A4 (via "Print as PDF" feature of PDF readers).

---
Update [2012-12-28]: It has nothing to do with paper-sizes. Something else is wrong, which causes failure of pdftex (which pdfcrop uses for page inclusion) with the error: `PDF inclusion: invalid font in reference type <dictionary>'. (This happens for many, but not all IA pdf files.) For the cause of this failure, and solution, please search for another post by me.

An incomplete post is here: http://archive.org/post/464880/
---

The second pdfcrop above (Eric Doviak’s) should not have problem with IA files. I have not tried Doviak’s pdfcrop, but the files pdfedit produces are like the IA files (i.e., pdfedit files have exactly the same issue as the IA files), so pdfedit should have no problem with processing IA files.

This post was modified by aibek on 2012-12-28 16:33:33

Reply to this post
Reply [edit]

Poster: martyveldman Date: Apr 15, 2012 2:08pm
Forum: texts Subject: Re: Try pdfcrop before printing PDF files

Foxit Reader does the same thing by adjusting Page Scaling. It's faster than Adobe PDF Reader and takes up a lot less space: 161 MB vs 10.7 MB.

http://www.foxitsoftware.com/Secure_PDF_Reader/

This post was modified by martyveldman on 2012-04-15 21:08:35

Reply to this post
Reply [edit]

Poster: aibek Date: Dec 28, 2012 7:51am
Forum: texts Subject: Re: Try pdfcrop before printing PDF files

Also check unpaper.
http://unpaper.berlios.de/

---
Update [2012-12-28]: Page scaling, translation, rotation and cropping can easily be done by pdfedit. (open source and free; available for GNU/Linux and Windows)
http://pdfedit.cz/en/user_doc.html

This post was modified by aibek on 2012-12-28 15:51:35

Reply to this post
Reply [edit]

Poster: garthus1 Date: Dec 28, 2012 8:46pm
Forum: texts Subject: Re: Try pdfcrop before printing PDF files

I no longer use Adobe reader on any customer machines. Foxit is much better. Significantly.

Gerry